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Abstract. We define a hierarchy of circuit complexity classes LD', 
whose depth are the inverse of a function in Ackermann hierarchy. 
Then we introduce extremely weak versions of length induction called 
L^ m 'IND and construct a bounded arithmetic theory L' 2 whose prov- 
ably total functions exactly corresponds to functions computable by LD 1 
circuits. Finally, we prove a non-conservation result between L 2 an d a 
weaker theory AC°CA which corresponds to AC . Our proof utilizes 
KPT witnessing theorem. 



1. Introduction 

A fundamental problem of boolean circuit complexity is whether the 
NC/AC hierarchy collapses. And the only known separation is that AC ^ 
NC 1 . In words, there is a gap between constant depth and logarithmic 
depth. This separation was done by Furst Saxe and Sipser || and Hastad 
H who showed that the parity function is not in AC (see also Ajtai [0]). 
Although there are some NC 1 functions which are not in AC , not much is 
known about the gap between these two classes. 

In this note we define a hierarchy which lies between AC and AC 1 . Let 
log( m ) x = log(log(- • • log a;)) (m times) and define the class LD % to be those 

which are computable by polynomial size ( log w n ) depth circuits. Then 



AC C • • • C LD 3 C LD 2 C AC 1 C LD l 

holds. Natural questions to ask about the LD hierarchy are whether LD 1 = 
LDi for some i ^ j, and even whether AC = LD 1 or LD 1 = AC 1 for some i. 
Furthermore, there lies several other interesting classes such as ALOGTIME, 
LOGSPACE or NLOGSPACE along this hierarchy, and inclusion between 
these classes and LD % also seems to be open. 
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In order to attack these problems we shall give a proof theoretical coun- 
terpart to the LD hierarchy. In S. Buss defined weak system of arithmetic 
whose provably total functions correspond to the polynomial hierarchy. His 
theory 5^ utilizes weak form of induction scheme which is called length in- 
duction (LIND). By modifying the length induction scheme in much weaker 
form, we can characterize theories which correspond to the LD hierarchy. 

The depth-bounding function log*-" 1 ) n is known to be an inverse of a func- 
tion in the Ackermann hierarchy, (cf. ||) so it is a very slow growing func- 
tion. Nevertheless, in the last section we show that their proof-theoretical 
counterparts are distinct. Namely, we shall use KPT witnessing theorem to 
separate our theory L\ from the theory AC°CA whose definable functions 
are those in AC . Although our result cannot be used to separate the corre- 
sponding complexity classes LD 1 and AC , it is strong enough an evidence 
to conjecture that AC ^ LD 1 . 

2. Basic Definitions 

Throughout this note we assume that all logarithms are base 2. Define 
the function log^ m ^ x for m G to and \x\ m by 

log^ x = log x, 

lo g (™+i) x = i og (i gM 7 

and 

|x|i = [log (a; + l)], 

l-^lm+l — 1 1 37 1 7TT, I . 

Also we denote |x|i by |x| which is equal to the length of binary represen- 
tation of x. 

We treat functions and sets of both natural numbers and binary strings. 
Numbers are often identified with binary strings by considering their binary 
expansions and conversely, binary strings are identified with corresponding 
natural numbers. The set of binary strings is denoted by {0, 1}* and binary 
strings with length n by {0, l} n . For a natural number x let \x\ be its length 
in binary. For any complexity class C we mean a class of functions and sets 
(predicates) are identified with their characteristic functions. 

As usual, a circuit is a directed acyclic graph with each node labeled by 
either x\,X2, • • • ,x n , A, V, ->. Internal nodes are called gates and labeled 
by either A, V, ->. Nodes without input edges are called input and labeled 
by one of x%,X2, ■ ■ ■ ,x n . The size of a circuit is the number of gates and 
the depth is the length of the longest path in it. The fan-in of a gate is the 
number of input edges and the fan- in of the circuit is the maximum of fan-in 
of gates in it. 

We assume that every circuit has only one output so that it computes 
a predicate. We say that a circuit family C\,... , C m computes a function 
/ : {0, l} n — > {0, l} m if its bitgraph is computed by each circuit Cj. Or 
equivalently, putting all Cj's altogether yields a multi-output circuit that 
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computes /. Hence we can assume that any finite function / : {0, 1}™ — > 
{0, l} m is computed by a single circuit. 

Furthermore, we assume that all circuit families are Ue* -uniform. 

Definition 1. A function f : {0, 1} + — > {0, 1} + is computed by a circuit 
family {C n } nga , if for all n € u>, f\ n (f restricted to the set {0,1}™,) is 
computed by C n . 

First we give a precise definition of the class LD 1 . 

Definition 2. LD 1 is the class of functions computed by some size, 

(log w nl depth circuits of unbounded fan-in. 

remark 1. LD 1 = NC. 

Let AC 1 be the class of functions computable by some polynomial size, 
0((logn) 1 ) depth circuits. Then 

Proposition 1. For i > 2, AC C LD i C AC 1 . 

remark 2. Since AC ^ AC 1 , at least either one of AC ^ LD 1 or LD 1 / 
AC 1 holds for any i G u>. 

We define a function algebra for LD 1 as follows: 

Definition 3. INITIAL is the finite set of functions which consists of: 
Z(x) = 0, PJ}{x\,... ,x n ) = Xk, sq(x) = 2x, si(x) = 2x + 1, \x\ = 
\log(x + l)] , Bit(x, i) = [x/2*J mod 2, x#y = 2^-^ . 

Definition 4. 1. A function f is defined by Concatenation Recursion on 
Notation ( CRN) from g, ho and h\ if 

f(0,y) =g(y), 
f(so(x),y) = s ho{x ^{f(x,y)), if x 
f(si(x),y) = s hl(jB! ff ) (f(x,y)) 

provided that hi(x, y) < 1 for all x and y and i = 0, 1. 
2. Let i G to. A function f is defined by i-Weak Bounded Recursion on 
Notation (W l BRN ) from g, ho, hi and k if 

F(0,y) =g(y), 

F(so(x),y) = h (x,y,F(x,y)), if x / 

F{si(x),y) = hi(x,y,F(x,y)) 

f(x,y) =F(\x\i,y), 

provided that F(x,y) < k(x,y) for all x, y. 

In |l(| it is proved that 

Definition 5. LD 1 is the smallest class of functions which contains INI- 
TIAL and closed under composition CRN and W' l BRN operations. 
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Clote and Takeuti [11 showed that 

Theorem 1. AC is the class of functions containing INITIAL and closed 
under composition and CRN operations. 

We may use a weak successor- type recursion within LD l . 

Proposition 2. LD l is closed under the following recursion operation: 

F(0,y) = g(y), 

F(x + l,y) = h(x,y,F(x,y), 

f(x,y) = F(\x\ i+1 ,y), 

where F(x,y) < k(x,y). 

Since it is unknown whether the majority function belongs to any of LD l 
for i > 1, we can extend the class LD Z by adding majority gates as in the 
definition of TC°. 

Definition 6. For i > 1, MD l is defined as LD l but with additional ma- 
jority gates. 

Then the characterization of LD % are modified as follows: 

Theorem 2. For i > 1, MD l is the smallest class of functions contain- 
ing INITIAL and multiplication and closed under composition, CRN and 
W l+1 BRN operations. 

3. The Theory L\ 

We assume that readers are familiar with basic notions of bounded arith- 
metic. The first order language Li consists of function symbols 

Z(x) = 0,P£(xi, . . . ,x n ) = x k , s (x) = 2x, si(x) = 2x + 1 
[x/2\,MSP(x,i) = [x/2 l \,\x\ = [log 2 (x + l)],x#y = 2N'M 

and a predicate symbol <. 

A quantifier is called bounded if it is either of the form Vx < t or 3x < t 
and sharply bounded if it is either of the form \/x < \t\ or 3x < \t\. A 
formula is bounded if all quantifiers are bounded and sharply bounded if all 
quantifiers are sharply bounded. Sq is the set of sharply bounded formulae. 
T,\ is the set of formulae in which all non-sharply bounded quantifiers are 
positive appearances of existential quantifiers and negative appearances of 
universal ones. IT^ is defined in the same way by replacing existential with 
umver sal. and D$ {i > 2) are defined in an analogous manner. 

Now let us state axioms that consist our theory L\. 

BASIC is a finite set of axioms which define symbols in Li. Let $ be a 
set of formulae. 

• $-Bit-Comprehension: 

By < 2 |t| Vi < \t\ [Bit(i,x) = I <-> (p(i)} , 
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• $-LIND: 

<p(0) A yx{ip(x) -> ip(x + 1)) -> Vx^(|x|), 

• <S>-I)IND (for i > 2): 

(/j(0) A Vx((p(x) — > 92(2; + 1)) — > Vx(/3(|x|j), 

• $-L*MAX: 

3x < acp(x) — > 3x < a (<p(x) A \/y < a(\x\i < \y\i — > -><p{y))) , 
where 93 £ 

Our characterization is similar to that of Clote and Takeuti § who in- 
troduced the notion of essentially sharply boundedness. 

Definition 7. Let T be a theory. A formula ip is essentially sharply bounded 
(esb) in T if it belongs to the smallest class C satisfying the following con- 
ditions: 

• every atomic formula is in C . 

• C is closed under boolean connectives and sharply bounded quantifica- 
tions. 

• If cpo, (fx & C and 

T h 3x < s(a)ipo(a,x) 

T h Vx,y < s{a)(ip (a,x) A (po(a,y) -> x = y) 

then 3x < s(a)((po(a, x) A ipi(a, x)) and Vx < s(a)((fo(a, x) — > </?i(a, x)) 
are in C . 

A formula is epH\ in T if it is of the form 3x\ < t\ ■ ■ ■ 3x k < tk(p(xi, . . . , x k ) 
where ip is esb in T. 

Definition 8. L\ is the Li theory which consists of the following axioms: 

• BASIC 

• Bit-Comprehension for esb formulae with respect to L\ 

• esb-LIND for esb formulae with respect to L\ 

• epT,\-L i+1 IND. 

The following fact will be used later. 

Proposition 3. epY<\-L l+1 MAX principle is a theorem of L l 2 . 

Proof. It is a well-known fact that E^-LIND and LT^-LIND are equivalent 
over BASIC+open-LIND and its proof is directly applied to show that 

L* 2 h epT![-L i+1 IND «-> epU b r L i+1 IND, 

where epll\ is the set of formulae of the form 

Vxi < t\ ■ ■ ■ Mx k < t k ip{xx, ... ,x k ) 

for esb formula ip in L\. It is also easy to see in L\ that epH\-L i+1 IND 
implies epT\-L i+1 MAX . " □ 

In the author defined a theory for AC functions. 



6 SATORU KURODA TOYOTA NATIONAL COLLEGE OF TECHNOLOGY 

Definition 9. The language L AC o consists of function symbols for all AC 
functions. AC°CA is the L AC o theory which consists of the following ax- 
ioms: 

• defining axioms for each f € AC , 

• Tjq-LIND. 

Theorem 3. AC°CA is an universal theory. 

It is also seen that AC°CA can be regarded as a subtheory of L\ since 

Proposition 4. Let f be a function symbol for an AC function and let 
L 2 (f) be the theory L\ extended by the symbol f together with its defining 
axioms. Then L 2 (f) is an conservative extension of L\. 

Hence in the following we assume that L\ has all AC functions in its 
language. 

Definition 10. A function f is esb definable in a theory T if there exists 
an esb formula ip in T such that 

T h Vx3y(p(x,y), 

T h Vz, y, zcp(x, z) A tp(y, z) -> x = y, 

The following immediately holds by the definition. 

Proposition 5. Let f(x,y) esb-define a function f in a theory T. Then 
the following formulae are equivalent in T(f) 

• 3x < s(a)(ip(a, x) A ip(a, x)) 

• Vx < s(a)(<f(a,x) -^il){a,x)) 

• ip(a,f(a)). 

Hence esb formulae are with respect to the theory in concern and 
sharply bounded in the extended language. 

4. Definability of LD % Functions in L\ 
First we shall show that LD l functions can be defined in L\. 
Theorem 4. If f G LD l then f is epH\ definable in L\. 

Proof. The proof is by induction on the complexity of / € LD l . 

By the assumption that L\ contains AC°CA, all INITIAL functions are 
epT,\ definable (actually esb definable) in AC°CA , hence also in L\. The 
same argument implies that the closure under composition and CRN are also 
proved within AC°CA. So it suffices to show that Y,\ definable functions of 
Lr, are closed under W l BRN operation. 

Let / be defined by W l BRN from g, ho, hi and k each has T>\ definition 
in L\. Let $(w) be the formula expressing that u w is a sequence of the 
computation of /" . Then it is readily seen that $ G epY,\ and 

4 h $(o) a vx($(H) + 1)). 
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So by Y\-L i+l IND we have L\ h Vx$(H m ). Hence the formula $ 
defines / provably in L\. □ 

5. Witnessing Proofs in L 2 

Now we shall show the converse to the previous theorem. Namely, All esb 
consequences of L % 2 are witnessed by some LD l functions. 

Theorem 5. Let ip € epT\ be such that L 2 \~ \/x3ytp(x,y). Then there 
exists a function f € LD l such that L\ h Vxip(x, f(x)). 

The proof is by the witnessing method. 

Definition 11. Let p be an esb formula in T . Then we denote the equiv- 
alent sharply bounded formula (in the extended language) by p sb (called sb 
version of p). If ip is epT\ of the form 3x\ < t% ■ ■ ■ 3x k < t k p(x\, . . . , x^) 
where <p> is esb then (p sb denotes the formula 

3xi < h ■ ■ ■ 3x k < t k ip sb (xi, ... , x k ). 

For sequents and inference rules, their sb versions are defined analogously. 

Theorem [| is a corollary to the following theorem. 

Theorem 6. Let T — > A be provable in L\ and T sb — > A sb be of the form 

3x 1 < siAf(a, x)A • • • A 3x m < s m A s £(a, x) 

3yi < tiB{ b {a, x) V • • • V 3y n < t n B sb {a, x) 

where A\,... , A m , Bi . . . ,B n are sharply bounded. Then there exist func- 
tions /i, • • ■ , f n G LD 1 such that 

h < si(o) Af{a,x) A ••• Ab m < s m (a)A s £(a,x) 

-» fl(a, b) < h(a)Bf(a, f(a, b)) V • • • V f n (a, b) < t n (a)B s n b (a, b), 

where b = b\ , . . . ,b m . 

Proof. Induction on the number of sequences in the L^-proof of the sequent 
r — > A. The proof is divided into cases according to the last inference of 
the -L2"P ro °f- It suffices to show that the last inference I is epY,\-L l+1 IND 
since other cases are identical to that of Theorem 4.3 in Ji. Let I sb be of 
the form 

3x < s(a)<p(b, a, x) — ► 3y < t(a)ip(a, y), 3x < s(a)(p(b + 1, a, x) 

3x < s(a)(p(Q,a,x) — > 3y < t(a)tp(a,y),3x < s(a)ip(\t'\i + i, a, x) 

where (p(b,a,c) and ip(a,d) are sharply bounded in the extended language 
that includes Skolem functions for esb formulae. By the induction hypothesis 
we have LD l functions g and h which witnesses the upper sequent, namely, 

c < s(a), ip(b, a, c) — > g(a, b, c) < t(a) A tp(a, g(a, b, c)), 

h(a, b, c) < s(a) A p{b + 1, a, h(a, b, c)), 
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Now define 

G'(a, c) = fix < \t'\ i+ x (g(a, x, c) < t(a) A ip(a, g(a, x, c))) 

and G(a,c) = g(a,G'(a,c),c). Since it is easily seen that the ^-operator of 
the form fix < can be computed by LD l , G' and hence G are in LD l . 

Define / by W i+1 BRN as, 

F(a, c, 0) = min(c, s(a)) 

F(a, c, n + 1) = min(c, /i(a, m, F(a, c, n))) 

f(a,c) = F(a,c,\t'\ i+1 ) 

Then / € LD* and 

c < s(a), y?(0, a, c) — > g'(a, c) < t(a) A ^(a, </(a, c)), 

/(a, c) < s(a) A (p(\t'\ i+1 , a, /(a, c)) 

holds. So we are done. □ 

Proof, [of Theorem ||] Let (p be esb in L\. Let T be an empty cedent and 
A = By < t(p(x,y). Then applying Theorem |B] to the sequent r — > A we get 
the claim. □ 

6. A NON-CONSERVATION RESULT FOR L\ 

In this section we shall show that for any i € u, the theory L\ is strictly 
stronger than the weaker theory AC°CA. The proof is based on KPT wit- 
nessing theory by Krajicek Pudlak and Takeuti Q which utilizes Herbrand's 
theorem for universal theories. 

First we introduce a computation principle that realizes the counter- 
example computation. 

Definition 12. Let R(x, y) be a binary predicate. Then define the predicate 
R (i) (x,y,z) 

\y\i < \x\i A (y > o R{ x > y)) A (\y\% < \ z \i < \ x \% ^R(%, *))■ 

The computation principle £l l (AC°) is the following: 

For any R(x, y) £ AC there exists a finite number of functions /i, ■ ■ ■ , fk G 
AC swc/i that 

either VzR® (a, fi (a) , z) is true, 

or if b\ is such that -ii?W(a, fi(a), b\) 
then either VzR( l \a, f2(a,bi), z) is true, 

or if hi is such that -\RW(a, fi{a, b\), 62) 

then ■ ■ ■ 

then VzR®(a, /fe(a, 61, • • • , 0fc-i, -2)) is true. 
By generalizing the proof in |J, the following holds, 
Proposition 6. For any i £ u, £l l (AC°) implies NP C AC /poly. 
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Proof. For simplicity, we will show for the case where i = 3. The general 
case can be treated similarly. Let A(x) = Vw < xB(x, w) be an NP-complete 
predicate via AC reduction where B G AC . Then it suffices to show that 
there exists a function g £ AC and a polynomial growth function h(n) such 
that 

A(x) ^ B(x,g(x,h(\x\))) 

holds. We say that w witnesses x if w < x A B(x,w) holds. Consider the 
formula 

R({ai, . . . , a m ), (wi, . . . , w n )) -£4> m < n A Vi < m(u/j witnesses cij). 

Then By Q S (AC°) there exists a finite sequences of functions fi, ■ ■ ■ , fk € 
AC which interactively compute a maximal number of witnesses w\, . . . ,w u 
for some initial segment of a\, . . . , a m . 

Fix n € N and let Vi = {x : |x| = n}. Let Z = 2 2 . First we shall con- 
struct an algorithm which computes a pair (j, w) on input a = (x\, . . . , X&) G 
V^.' such that t« witnesses Xj. The algorithm works as follows: 

begin 

y = fi(a) 

if Zen(y) > 1 and R(a, y) then 

output and halt 

endif 

for i := 2 to k do 

y := fj(a,bi,... ,bj - 1) 

if len(y) > j and i?(a, y) then 

^ := <(y) 2 aJ-a+i,--- ,(l/) 2 2i-0 
output and halt 

endfor 
end. 

Let Q C Vi be such that |Q| = Z — 1 and v £ V\ \Q. We say that Q helps v 
if for some ordering a = (x\, . . . , Xj-i,v, Xj + i, . . . , x\) of Q U {t> }, the above 
algorithm computes a witness (j, u;) for v. Since Z is a constant value, it is 
straightforward to see that there exists an AC algorithm to check on input 
Q and v whether Q helps v. Furthermore, such algorithm can compute a 
witness w(Q, v) for v if exists. 

Now we shall show that with the help of polynomially many sets Qi, Q2, ■ ■ ■ 
each with size I — I, we can compute a witness for given a G Vi, thus we can 
construct an AC algorithm which takes a and polynomial advice Qi, Q2, ■ ■ ■ 
as input and compute a witness for a. 

Let |Vi| = TV. Note that there are at least (ff) pairs of (Q,v) such that 
Q helps v while there are at most G^J many sets Q of size I — 1. So there 

exists a set Q± which helps at least N ~} +1 different elements of V\ \Qi- 
Define the set Vi inductively as, 

Vi+i = {v e Vi : Qi does not help v} . 
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Then in general, there exists Qi+i Q Vj+i which helps at least ' +1 
many elements of V^+i \ Qi+i- Let t = min{i : \Vi\ < I}. Then since 

| V(+1 | - m -Mz>±l < Lzi m + 1 
<(^)W* 

it holds that t = [log//(/_!) N~\ = 0(n). Let w(x) be an canonical witness 
for x and define Si by 

:= {(x,w(x)) : x G Qi) for i < t, 
S t := {(x,w(x)) : xGVt). 

Finally let h(n) = S = (Si,... ,S t ). Then \S\ = 0{ln 2 ). Now we can 
construct an algorithm which computes a witness for x with the help of S. 

begin 

if x occurs in S then output w{x) 
else 

for j := 1 to t — 1 do in parallel 

if Qj helps x then u)j := w(Qj,x) 
endfor 

output iUj with minimal j < t 
endif 
end. 

Since this algorithm is in AC we are done. □ 
Corollary 1. ^(AC ) is false. 

Proof. This follows from the fact that the parity function is not in AC /poly, 
since Hastad's proof || that parity cannot be computed by constant depth 
polynomial size circuits can be applied to nonuniform case. □ 

Lemma 1. For any R(x,y) G AC , L\ h 3yVzRy +1 '(a, y,z). 

Proof. Let R G AC . Then clearly it is esb in L\. So by epE|-L( i+1 ) MAX 
principle in L\ there exists a maximal t such that 

3x < aR(a, x) —>■ 3x < a(\x\i + i = t A R(a, x)). 

Thus L % 2 proves 

3x < aR(a,x) — ► 3x < aVy < aR(a,x) A (|x|j + i = — > ~^R(a,y)). 
which is logically equivalent to our assertion. □ 

Now we conclude that AC°CA and L\ are distinct. 
Theorem 7. L| ^ s n °t a conservative extension of AC°CA. 
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Proof. Let R(x,y) e AC and suppose AC°CA = L\. Then 

AC°CA h 3yVzi?( i+1 ) (a, y, z) 

and as AC°CA is an universal theory, we may apply Herbrand's theorem to 
get a finite number of functions fx, . . . , f k such that 

R^ia, h(a), 6 1 )V J R( i+1 )(a, f 2 (a, b x ), 6 2 )V- • -VR^ +1 \a, f k (a, h,..., b k ^), b k ) 

holds. Therefore ^' l+1 ' > (AC ) holds and this contradicts to Corollary |l]. 
Hence AC°CA / L\. □ 

7. Concluding Remarks 

The system L\ has a slightly complicated definition: it has sbY\-L l+l IND 
as an induction axiom. One might ask what if we replace it with simply 
T\-L t+1 IND. I other words, what is the computational complexity of 
L' l+l IND. Or, more generally, the problem can be stated as follows: 

Problem 1. Specify the complexity of T> b k - definable functions of AC°CA + 
Y, b k - LUND. 
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